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The ALICE High-Level Trigger processes data online, to either select interesting (sub-) events, or to compress 
data efficiently by modeling techniques. Focusing on the main data source, the Time Projection Chamber, the 
architecure of the system and the current state of the tracking and compression methods are outlined. 



1. Introduction 

The ALICE experiment described in |H2l3j will 
investigate Pb-Pb collisions at a center of mass 
energy of about 5.5 TeV per nucleon pair and pp 
collisions at 14 TeV. Its detectors are optimized 
for heavy-ion reactions at an anticipated charged 
particle multiplicity of up to dN c h/dr] of 8000 in 
the central region [3]. 

The main central tracking detector, the Time 
Projection Chamber (TPC), is read out by about 
600 000 channels, producing at most a data size 
of 75 MB per event for central Pb-Pb under ex- 
treme assumption and of 2.5 MB for pp collisions. 
The estimated value of the maximum gate fre- 
quency is about 200-1000 Hz depending on event 
multiplicity 0. 

The overall event rate is limited by the Data 
Acquisition (DAQ) bandwidth to the permanent 
storage system of 1.25 GB/s. Without further 
reduction or compression the ALICE TPC can 
only take central Pb-Pb events at up to 20 Hz. 
Significantly higher rates are possible by either 
selecting interesting (sub-) events, or compress- 
ing data efficiently by modeling techniques; both 
requiring real-time analysis of the detector infor- 
mation with a latency of the order of a few ms. 
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To accomplish the pattern recognition tasks at an 
incoming date rate of 10-25 GB/s, a massive par- 
allel computing system, the High-Level Trigger 
(HLT) system, is being designed pj. 

The key requirement for the HLT system is the 
ability to perform the event analysis in real-time. 
Based on the extracted information, charge clus- 
ters and tracks, data reduction can be performed 
in different ways: 

• Trigger: Generation and application of a 
software trigger capable of selecting inter- 
esting events from the input data stream. 

• Select: Reduction in the size of the event 
data by selecting (sub-) events and/or re- 
gion of interest (Rol) . 

• Compression: Reduction in the size of the 
event data by compression techniques. 

As such the HLT system will enable the ALICE 
TPC detector to operate at a rate up to a few 
hundred Hz for heavy-ion and up to 1 kHz for pp 
collisions. 

2. Data Flow and HLT Architecture 

Figure shows a sketch of the architecture of 
the system adapted to the anticipated data flow 
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Figure 1. Data-flow architecture of the HLT sys- 
tem. The detector raw data is duplicated and 
received by the DAQ and HLT system. 



from the ALICE detectors, focusing on the TPC. 
The TPC consists of 36 sectors, each sector being 
divided into 6 sub-sectors. For each sub-sector 
the data is transferred via an optical fiber from 
the detector front-end into DAQ Read-Out Re- 
ceiver Cards (D-RORC). From there a copy of the 
data is sent into HLT Read-Out Receiver Cards 
(HLT-RORC) interfaced to the receiving nodes 
through the internal PCI bus. In addition to dif- 
ferent communication interfaces, the HLT-RORC 
provides a FPGA co-processor for the data in- 
tensive local tasks of the pattern recognition and 
enough external memory to store several dozen 
event fractions. The receiver nodes perform the 
local pre-processing task, cluster and track seed- 
ing on the sub-sector level using the FPGA co- 
processor. The next two levels of computing 
nodes exploit the local neighborhood: track seg- 
ment reconstruction on sector level. Finally all lo- 
cal results are collected from the other sectors (or 
in principle from the other ALICE sub-detectors) 
and combined on a global level: track segment 
merging and final track fitting. 

The computing farm is designed to be com- 
pletely fault tolerant avoiding single points of 
failure, except for the unique detector links. A 
generic communication framework has been de- 



veloped based on the publisher-subscriber princi- 
ple, which one allows to construct any hierarchy 
of communication processing elements |7I8| . 

3. Online Pattern Recognition 

In general, the HLT system has to reconstruct 
the complete event information online. Concern- 
ing the TPC and the other tracking devices, the 
particles ideally follow helical trajectories due to 
the solenoidal magnetic field of the L3 magnet 
surrounding these central detectors. 

For HLT tracking, we distinguish two different 
approaches: the "sequential feature extraction" 
and the "iterative feature extraction" |9llfl| . 

The sequential method approximates the clus- 
ter centroids using a fast "Cluster Finder". The 
centroids are used as input for the "Track Fol- 
lower" to determine the corresponding track pa- 
rameters. This approach is applicable for lower 
occupancy like pp and low multiplicity Pb-Pb 
collisions. However, at larger multiplicities ex- 
pected for Pb-Pb at LHC, clusters start to over- 
lap and deconvolution becomes necessary in order 
to achieve the desired tracking efficiencies. 

Whereas, the iterative approach first esti- 
mates potential tracks using a "Track Candidate 
Finder", which are then feeded to a "Cluster Fit- 
ter" to assign clusters to the tracks thereby de- 
convoluting overlapping clusters shared by differ- 
ent tracks candidates. In both final helix 
fit on the assigned clusters determines the track 
parameters. 

In order to reduce data shipping and communi- 
caton overhead within the HLT, as much as pos- 
sible of the local pattern recognition will be done 
using the FPGA co-processor [TT] . 

3.1. Sequential Tracking Approach 

The HLT tracking scheme for low multiplic- 
ity events has been adapted from the STAR L3 
trigger The cluster finder estimates cluster 
centroids by the weighted mean in pad-and-time 
direction. Overlapping clusters are split at local 
minima. The list of space points is then handed to 
the track follower, which at first forms neighbor- 
ing clusters into segments and then merges seg- 
ments into tracks. The tracking performance has 
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Figure 2. Tracking efficiency for dNck/dr) = 2000 
and a magnetic field strength of 0.4 T as a func- 
tion of pt compared for online (HLT) sequential 
and offline tracking scheme. 



Figure 3. Relative transverse momentum reso- 
lution for dN c h/dr] — 2000 and a magnetic field 
strength of 0.4 T as a function of pt compared 
for online (HLT) sequential and offline tracking 
scheme. 



been extensively studied and compared with the 
offline TPC reconstruction chain jfilflllOj . It turns 
out, that the method is well suited and within the 
anticipated time budget for charged particle mul- 
tiplicities of up to diVch/dr; = 2000. 

Concentrating on dN^/drj = 2000 and a mag- 
netic field of 0.4 T, Fig. El shows a comparison 
of the tracking efficiency as a function of pt for 
the HLT and offline tracking schemes. In both 
cases, the efficiency drops for low momentum 
tracks (pt < 0.5 GeV) due to the crossing of sec- 
tor boundaries. The integral efficiency is about 
90% and the fake track rate of the order of 1.5%. 
Fig. displays the momentum resolution as a 
function of pt for the same setup. On average, the 
HLT cluster finder takes about 15 ms, the tracker 
about 750 ms. 

3.2. Iterative Tracking Approach 

The simple sequential method may be applica- 
ble to dN ch /dr) = 4000, but ALICE is preparing 
for even higher multiplicities of up to dN^/dr] = 
8000. In that case, the occupancy in the TPC is 
of the order of 25% and deconvolution becomes 
mandatory in order to achieve the desired track- 
ing efficiencies. The cluster shape is dependent 
on the track parameters, and in particular on 
the crossing angles of tracks between the padrow 



and drift time. In order to properly deconvo- 
lute the overlapping clusters, knowledge of the 
track parameters that have produced the clusters 
is needed. Once track candidates are given, the 
clusters can be fit to their known shape, and the 
cluster centroid can be correctly reconstructed. 
If the fit fails, the track candidate will be disre- 
garded. 

Hough Transform 

The Hough Transform (HT) method could be 
suitable to estimate track candidates, as it can 
be applied directly on the raw ADC data. In 
its original form ^3] the HT determines a curve 
in parameter space for a signal corresponding to 
all possible tracks with a given parametric form 
to which it could possibly belong. The space is 
then discretized and entries are stored in a his- 
togram. For all entries exceeding a given thresh- 
old in the histogram the corresponding parame- 
ters are found. 

For simplification of the transformation the 
detector is divided into subvolumes in pseudo- 
rapidity. In addition, restricting the analysis to 
tracks originating from the vertex, a track in the 
r\- volume is characterized by two parameters: the 
emission angle with the beam axis tp and the 
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curvature n. The transformation is performed 
from (R,c/>)-space to (^,n)-space using the follow- 
ing equations: 
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The integral efficiency for "good" track candi- 
dates is above 90% for a full multiplicity event 
and a magnetic field of 0.2 T. This was esti- 
mated P] by dividing the number of verified track 
candidates divided with the number of findable 
tracks estimated by the offline analysis . The list 
of verified track candidates was obtained by lay- 
ing out a road in the raw data corresponding to 
the track parameters of the peak. If enough clus- 
ters were found along the road, the track candi- 
date was considered a track, if not the track can- 
didate was disregarded. However, depending on 
certain threshold parameters, the number of fake 
track candidates coming from spurious peaks in 
the parameter space is above 200%. That prob- 
lem of the HT method has to be overcome in order 
to determine valuable track candidates as input 
for the cluster fitting and deconvoluting proce- 
dure. 

Cluster Fitter 

The cluster fitting method were initially imple- 
mented for TPC data in the NA49 experiment. 
The Cluster Fitter (CF) fits a two-dimensional 
Gauss function with 5 parameters to the charge 
clusters: 

• the position in pad and time direction; 

• the widths in pad and time; 

• the amplitude of the distribution. 

The initial values of the fit parameters are pro- 
vided by the track parameters. The position in 
pad and time direction is obtained by calculat- 
ing the crossing point between the tracks and the 
two-dimensional padrow plane. The widths are 
obtained from the parameterisation of the clus- 
ter model as a function of the track parameters. 



During the minimisation procedure of the least 
square error, the widths are held fixed at their 
input values, while the position in pad and time 
are free to vary. 



TPC 
rawdata 



Hough 
Transform 



Add 
histograms 
(opt.) 



Find local 
maxima 



Find/ 
deconvolute 
clusters 



Track fit, 
Detector 
merging 



Figure 4. Flow diagram showing the pattern 
recognition scheme using the Hough Transform 
and the Cluster Deconvoluter/Fitter. 



A flow diagram of the complete chain of HT and 
CF is shown in Fig. 31 The HT is done locally 
on the receiver nodes. The data volume there- 
fore corresponds to 1/6 of a complete TPC sec- 
tor. Optionally the respective histograms within 
a complete sector can be added in order to im- 
prove the signal/noise ratio of the peaks in the 
parameter space. A simple over-threshold local- 
maxima finder processes the histograms and iden- 
tifies the peaks corresponding to track candidates. 
The list of track candidates is passed to the CF 
to deconvolute the clusters along the respective 
track roads. The obtained tracks and their as- 
signed clusters are finally fitted to obtain their 
track parameters. 

FigureG3shows the efficiency obtained using the 
HT and cluster fitting approach for dN c h/dr] = 
4000 and a magnetic field strength of 0.4 T as a 
function of pt- For comparison, the results for the 
sequential cluster finder/track follower approach 
and the offline tracking scheme are added. The 
efficiency loss with respect to the standard track- 
ing is significant. The reason is, that even for 
p t > 0.5 GeV the HT produces a large number 
of fake track candidates from structured back- 
grounds in the parameter space (as mentioned 
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Figure 5. Tracking efficiency of the Hough Trans- 
form and Cluster Fitter (HT+CF) as a function 
of p t > 0.5 GeV for dN ch /dr] = 4000 and a mag- 
netic field strength of 0.4 T. For comparison, the 
efficiency of offline and the HLT sequential ap- 
proach (CF+TF) is also shown. 



above). These fake tracks are filtered out during 
the cluster fitting procedure by removing tracks, 
which do not point to valid clusters. However, 
their presence causes the cluster fitting procedure 
to become unstable, if too many fake tracks are 
pointing to the same cluster. As a result, the clus- 
ter fit may be rejected on the basis of y 2 criteria, 
resulting in a failure to fit the cluster. 

In order to ensure the principal function of the 
CF, offline tracks were feeded in the cluster fit- 
ting procedure representing optimal track candi- 
dates. Here, even at dA^h/dr? = 8000 and a mag- 
netic field strength of 0.4 T, the cluster fitter was 
able to improve the relative p t resolution by 10% 
and the azimuthal resolution by 15%, showing 
that the deconvolution/fitting procedure is work- 
ing and even capable of improving the offline res- 
olution. 

4. Data modeling and Data compression 

Besides triggering and selecting (parts of) 
events, one of the mains tasks of the HLT sys- 
tem is to compress the event data efficiently with 
a minimal loss of physics information. In general, 
two schemes of data compression are considered: 



• Binary lossless data compression, al- 
lowing bit-by-bit reconstruction of the orig- 
inal data set; 



• Binary lossy data compression, not al- 
lowing bit-by-bit reconstruction of the orig- 
inal data, while however retaining all rele- 
vant physical information. 

Most state of the art compression techniques 
were studied on real NA49 and simulated TPC 
data and presented in detail in All methods 
roughly result in compression factors close to 2. 

However, the most effective data compression 
can be expected by cluster and track model- 
ing |9I10| . Here, the input to the compression 
algorithm is a lists of tracks and their correspond- 
ing clusters. For every assigned cluster, the clus- 
ter centroid deviation from the track model is 
calculated in both pad and time direction. Its 
size is quantized with respect to the given detec- 
tor resolution and represented by a fixed number 
of bits. The quantization steps have been set to 
0.5 mm for the pad direction and 0.8 mm for the 
time direction, which is compatible with the in- 
trinsic detector resolution. In addition the total 
charge of the cluster is stored. Since the cluster 
shape itself can be parameterised as a function 
of track parameters and detector specific param- 
eters, the cluster widths in pad and time are not 
stored for every cluster. During the decompres- 
sion step, the cluster centroids are restored, and 
the cluster shape is calculated based on the track 
parameters. 

The compression scheme has been applied to 
a simulated Pb-Pb event with a multiplicity of 
dAch/d?7 = 1000. The input tracks used for the 
compression are tracks reconstructed with the se- 
quential tracking approach. The clusters, which 
were not assigned to a track during the track 
finding step, were disregarded and not stored for 
further analysis. A relative size of 11% for the 
compressed data with respect to the original set 
is obtained, whereas having a 2% efficiency loss 
compared to the original data. 
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5. Conclusion 

Focusing on the TPC, the HLT system is de- 
signed to increase the readout and storage of rele- 
vant physics events by a factor of 10. The sequen- 
tial approach, cluster finding followed by track 
finding, is applicable for pp and low multiplicity 
Pb-Pb data up to dN ch /drj = 2000 with more 
than 90% efficiency and might be extensible for 
higher particle densities. The timing results show 
that the desired rate of 1 kHz for pp and 200 Hz 
for Pb-Pb will be achievable. For multiplicities 
of diV c h/d?7 > 4000 , the iterative approach is 
foreseen using a track candidate finder followed 
by cluster deconvolution using the cluster fitter. 
Here, the Circle Hough Transform as a possible 
method to find track candidates seems to fail, as 
its high number of fake track candidates confuses 
the otherwise well working cluster fitter. So far, 
the problem to find a quantitative much better 
track candidate finder has not been solved. 

By compressing the data using data modeling 
techniques, the results for low multiplicity events 
show that with a loss of tracking efficiency of 
about 2% a compression factor of up to 10% with 
respect to the original data sizes is achievable. 
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